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Salipiger mucosus Martmez-Canovas ef al. 2004 is the type species of the genus Salipiger, a 
moderately halophilic and exopolysaccharide-producing representative of the Roseobacter 
lineage within the alphaproteobacterial family Rhodobacteraceae. Members of this family 
were shown to be the most abundant bacteria especially in coastal and polar waters, but 
were also found in microbial mats and sediments. Here we describe the features of the S. 
mucosus strain DSM 16094^ together with its genome sequence and annotation. The 
5,689,389-bp genome sequence consists of one chromosome and several extrachromosomal 
elements. It contains 5,650 protein-coding genes and 95 RNA genes. The genome of S. 
mucosus DSM 1 6094^ was sequenced as part of the activities of the Transregional Collabora- 
tive Research Center 51 (TRR51) funded by the German Research Foundation (DFG). 



Introduction 

The Roseobacter clade is a very heterogeneous 
group of marine Alphaproteobacteria that plays an 
important role in the global carbon cycle and oth- 
er biogeochemical processes [1]. Members of this 
group form an allegedly monophyletic, physiologi- 
cally heterogeneous, as well as metabolically ver- 
satile group of bacterioplankton [1]. They are 
known to live in the open ocean, especially in 
coastal areas, where they have been found many 
times in symbiosis with algae, in microbial mats, 
sediments, or associated with invertebrates, but 
representatives of this lineage were also isolated 
from marine environments like polar waters or 
sea ice [1-4], which is also presented and reflected 
by their genome sequences [2]. Whereas some 
members of the Roseobacter clade contain the 
pigment bacteriochlorophyll a and are capable of 
aerobic anoxygenic photophosphorylation, other 
members were found to transform 
dimethylsulfonylpropionate into dimethylsulfide 
[4-6]. 

Some representatives of the Roseobacter lineage 
such as Salipiger mucosus A3t are also known to 




be moderate halophiles, which are adapted to a 
wide range of salinities and were found to pro- 
duce special compounds like compatible solutes, 
halophihc enzymes or exopolysaccharides [7-9]. 

Strain A3t (= DSM 16094T = LMG 22090T = CECT 
5855T) represents the type strain of S. mucosus 
[initially proposed as 'S. muscescens') in the mono- 
typic genus Salipiger [10] and was isolated from 
saline soil bordering a saltern on the Mediterrane- 
an Sea coast at Calblanque [Spain) [7]. The genus 
name Salipiger was derived from the Latin noun 
sal, sails ['salt') and the Latin adjective piger ['la- 
zy') [10]. The species epithet mucosus refers to the 
Latin adjective mucosus ['slimy, mucous') [10]. 
Current PubMed records do not indicate any fol- 
low-up research with strain AS^ after the initial 
description of 5. mucosus [7] and the characteriza- 
tion of its exopolysaccharide [11]. 

In this study we analyzed the genome sequence of 
S. mucosus DSM 16094T, which was selected for 
sequencing under the auspices of the German Re- 
search Foundation [DFG) Transregio-SFBSl 
Roseobacter grant because of its phylogenetic po- 
sition [12] and was also a candidate for the Ge- 
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nomic Encyclopedia oi Archaea and Bacteria [13]. 
We present a description of the genomic sequenc- 
ing and annotation and present a summary classi- 
fication together with a set of features for strain 
DSM 16094T including novel aspects of its pheno- 
type. 

Classification and features 
1 6S rRNA gene analysis 

The single genomic 16S rRNA gene sequence of S. 
mucosus DSM 16094T was compared with the 
Greengenes database for determining the 
weighted relative frequencies of taxa and (trun- 
cated) keywords as previously described [14]. The 
most frequently occurring genera were Salipiger 
(21.2%), Pelagibaca (17.1%), Roseovarius 
(17.0%), Marinovum (13.1%) and Roseobacter 
(9.5%) (30 hits in total). Regarding the single hit 
to sequences from members of the species, the av- 
erage identity within high scoring pairs (HSPs) 
was 100.0%, whereas the average coverage by 
HSPs was 97.2%. Regarding the two hits to se- 
quences from other members of the genus, the av- 
erage identity within HSPs was 98.4%, whereas 
the average coverage by HSPs was 99.0%. Among 
all other species, the one yielding the highest 
score was 'Salipiger bermudensis' ( DQ178660 ). 
which corresponded to an identity of 96.8% and a 
HSP coverage of 99.9%. (Note that the Greengenes 
database uses the INSDC (= EMBL/NCBI/DDBJ) 
annotation, which is not an authoritative source 
for nomenclature or classification). The highest- 
scoring environmental sequence was AB302369 
(Greengenes short name 'Hydrocarbon-Degrading 
Indonesian Seawater seawater isolate B44-2B44-2 
str. B44-2'), which showed an identity of 97.4% 
and an HSP coverage of 95.4%. The most frequent- 
ly occurring kejwords within the labels of all en- 
vironmental samples that yielded hits were 'aquat, 
rank' (4.9%), 'microbi' (3.7%), 'harbour, newport' 
(3.3%), 'water' (2.7%) and 'seawat' (2.4%) (219 
hits in total) and in line with the habitat from 
which strain A3t was isolated. Environmental 
samples that jnelded hits of a higher score than 
the highest scoring species were not found. 

Figure 1 shows the phylogenetic neighborhood of 
5. mucosus strain DSM 16094T in a 16S rRNA gene 
sequence based tree. The sequence of the single 
16S rRNA gene copy in the genome does not differ 
fr-om the previously published 16S rRNA gene se- 



quence (AY527274), which contains two ambigu- 
ous base calls. 

Morphology and physiology 

Cells of strain A3''' are pleomorphic and stain 
Gram-negative (Figure 2). They are 1 ^m in width 
and 2.0-2.5 \un in length. Motility was not ob- 
served. They live a strictly aerobic and chemohet- 
erotrophic lifestyle. Colonies grown on MY solid 
medium are circular, convex, cream-colored and 
mucoid, whereas in liquid medium their growth is 
uniform. Cells are encapsulated. They are moder- 
ately halophilic, and capable of growth in a mix- 
ture of sea salts from 0.5 to 20% (w/v), whereas 
the optimum is between 3 and 6% (w/v) sea salts. 
When using NaCl instead of sea salts, optimal 
growth occurs at a salt concentration of 9-10% 
(w/v). Cells of strain AS^ grow within a tempera- 
ture range of 20-40°C and at a pH range between 6 
and 10. Growth does not occur under anaerobic 
conditions either by fermentation, fumarate or ni- 
trate reduction or photoheterotrophy. Cells are cy- 
tochrome oxidase and catalase positive. 
Polyhydroxyalkanoates (PHA) are stored as re- 
serve material within the cells. H2S is produced 
from L-cysteine. Selenite reduction, gluconate oxi- 
dation and phosphatase were observed. Urea and 
Tween 20 are hydrolyzed. A variety of tested or- 
ganic compounds were neither metabolized nor 
sustained growth; for details see [7]. 

The utilization of carbon compounds by S. 
mucosus DSM 16094'r grown at 28°C was also de- 
termined for this study using Generation-Ill 
microplates in an OmniLog phenotyping device 
(BIOLOG Inc., Hayward, CA, USA). The microplates 
were inoculated with a cell suspension at a cell 
density of 95-96% turbidity and dye IF-A. Further 
additives were vitamins, micronutrient and sea- 
salt solutions, which had to be added for dealing 
with such marine bacteria [29]. The plates were 
sealed with parafilm to avoid a loss of fluid. The 
exported measurement data were further ana- 
lyzed with the opm package for R [30,31], using its 
functionality for statistically estimating parame- 
ters from the respiration curves such as the max- 
imum height, and automatically translating these 
values into negative, ambiguous, and positive re- 
actions. The reactions were recorded in three in- 
dividual biological replicates. 
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- Citreimonas salinaria (AY962295) 
Roseivivax halodurans (D85829) 

- Yangia pacifica (AJ877265) 



- Rubellimicrobium themophilum (AJ844281) ' 



Citreicella Ihiooxidans (AY639887) 

Salipiger mucosas (IMG2523507148) ' 

Palleronia marisminoris (A Y926462) 



Pelagibaca bermudensis (D01 78660) ' 
Salinihabitans flavidus (FJ265707) 
Marinovum algicola (X78315) 

— Leisingera methylohalidivorans (A Y005483) ' 
Phaeobacter gallaeciensis (Y13244) " 

Seohaeicola saemankumensis (EU221274) 

— Litorimicrobium taeanense (GQ232737) 

— Thalassobius mediterraneus (AJ878874) 
Thalassococcus halodurans (DQ397336) 

- Primorskyibacter sedenla rius (A B550558) 
] 9 species 




1 1 species 



J 4 species 

Hasllibacter halocynthiae (FJ638616) 

- Actibacterium mucosum (HE590855) 



J 32 species 



Lutimaribacter saemankumensis (EU336981) 

Tropicimonas isoalkanivorans (AB302379) 

- Maritimibacter alkaliphilus (D09 15443) ' 

Dinoroseobacter shibae (AJ534211) " 

- Roseibacten'um elongatum (AB601471) ' 
Tranquillimonas alkanivorans (AB302386) 



Figure 1 . Phylogenetic tree highlighting the position of S. mucosas relative to the type strains of the type species of the 
other genera within the family Rhodobacteraceae. The tree was inferred from 1,331 aligned characters of the 16S 
rRNA gene sequence under the maximum likelihood (ML) criterion as previously described [14]. Rooting was done 
initially using the midpoint method [15] and then checked for its agreement with the current classification (Table 1). 
The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches 
are support values from 650 ML bootstrap replicates (left) and from 1,000 maximum-parsimony bootstrap replicates 
(right) if larger than 60% [14]. (That is, the backbone of the tree is largely unresolved.) Lineages with type strain ge- 
nome sequencing projects registered in GOLD [16] are labeled with one asterisk, those also listed as 'Complete and 
Published' with two asterisks [3,17]. 



The strain was positive for pH 6, 1% NaCl, 4% 
NaCl, 8% NaCl, D-galactose, 3-0-methyl-D-glucose, 
D-fucose, L-fucose, L-rhamnose, 1% sodium lac- 
tate, myo-inositol, rifamycin SV, L-aspartic acid, L- 
glutamic acid, L-histidine, L-serine, D-glucuronic 
acid, quinic acid, L-lactic acid, citric acid, a-keto- 
glutaric acid, D-malic acid, L-malic acid, nalidixic 
acid, sodium formate and the positive control. 

No reactions could be detected for the negative 
control, dextrin, D-maltose, D-trehalose, D- 
cellobiose, /?-gentiobiose, sucrose, D-turanose, 
stachyose, pH 5, D-raffinose, a-D-lactose, D- 
melibiose, ^-methyl-D-galactoside, D-salicin, iV- 
acetyl-D-glucosamine, iV-acetyl-)?-D-mannos- 
amine, iV-acetyl-D-galactosamine, N-acetyl-neura- 
minic acid, D-glucose, D-mannose, D-fructose, 
inosine, fusidic acid, D-serine, D-sorbitol, D- 
mannitol, D-arabitol, glycerol, D-glucose-6- 
phosphate, D-fructose-6-phosphate, D-aspartic ac- 



id, D-serine, troleandomycin, minocycline, gelatin, 
glycyl-L-proline, L-alanine, L-arginine, L- 
pyroglutamic acid, lincomycin, guanidine hydro- 
chloride, niaproof, pectin, D-galacturonic acid, L- 
galactonic acid-y-lactone, D-gluconic acid, 
glucuronamide, mucic acid, D-saccharic acid, 
vancomycin, tetrazolium violet, tetrazolium blue, 
p-hydroxy-phenylacetic acid, methyl pyruvate, D- 
lactic acid methyl ester, bromo-succinic acid, lithi- 
um chloride, potassium tellurite, tween 40, y- 
amino-n-butyric acid, y-hydroxy-butyric acid, p- 
hydroxy-butyric acid, a-keto-butyric acid, 
acetoacetic acid, propionic acid, acetic acid, 
aztreonam, butyric acid and sodium bromate. 

Martmez-Canovas et al. tested the strain A3t for 
growth on a variety of substrates, none of which 
were utilized under the applied conditions [7]. In 
contrast to this result, OmniLog measurements 
detected respiration in nearly twenty wells, in- 



http://standardsingenomics.org 



1333 



Salipiger mucosus sp. nov. 



eluding several sugars and amino acids. This may 
be due to respiratory measurements being more 
sensitive than growth measurements [32]. For in- 
stance, the positive reactions detected only in the 
OmniLog instrument might be caused by sub- 
strates that were only partially metabolized. 



An important physiological property, the 
halophilic lifestyle, could be confirmed by the 
OmniLog measurements, showing that S. mucosus 
is able to grow in up to 8% NaCl. According to [7] 
the salt tolerance of this strain exceeds 10% NaCl. 




Figure 2. Phase-contrast micrograph of S. mucosus DSM 16094^ 



Chemotaxonomy 

The principal cellular fatty acids of strain A3t are 
(78.0%), Ci6:o (12.4%), C 12:1 3-OH (2.3%), 
^19:0 cyclo toSc (2.3%), Ci8:o (2.0%) and Cu-.ia^ic and/or 
Ci5:o iso 2-oH (1.3%). The presence of Ci8:ia)7c as pre- 
dominant fatty acid is a feature characteristic of 
several taxa within the Alphaproteobacteha (e.g. 
Jannaschia helgolandensis DSM 14858^ [33], 



Octadecabacter arcticus CIP 106731T [34], 
Ruegeria algicola ATCC 51440^ [35], Sulfitobacter 
mediterraneus DSM 12244T [36] and Staleya 
guttiformis DSM 14443T [37]). 

The only detected respiratory lipoquinone was 
ubiquinone 10, which is a well-known characteris- 
tic of alphaproteobacterial representatives (all da- 
ta from [7]). 
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Table 1. Classification and general features of S. mucosus DSM 1 6094^ according to the MIGS recommen- 
dations [18] (published by the Genome Standards Consortium [19]). 



MIGS ID 


Property 


Teirn 


Evidence code 






Domain fi^t^ff^fi^ 


TAS r?ni 






rilylUlll It L/iizL/UcH^icI la 


1 r\j [Z 1 J 






Class Alnhanfntpnhartpria 


TAS [22 2V 






v_y I UCI f \/ I\J\J\JkJc1 L LCI die J 


TAS r?3 ?41 






Family Rhodohd.ctQfstcGd.c 


TAS [251 






Genus SBlipigGr 


TAS [7[ 






Species S3.lipigGr mucosus 


TAS [7 26[ 






JLI d 1 1 1 / \ J 


TAS ni 




("J ram titpin 


npaafivP 

1 1 V- LI V \„ 


TAS [1[ 






mrl-tih ;^ npri 

1 VJU Dlldl-JdJ 


TAS [1 [ 




\^Mtl 1 lt\/ 

( VIWLI 1 1 Ly 


1 IVJI 11 1 ILJLI It; 


TAS n 1 






1 lUL 1 cUvJl LCU 






Xf^mnfira i ff^ ra n a 

1 CI 1 lUCl dLUI C I dl IHC 


Z.\J-'-r\J v_. 


TAS n 1 




f~^rit 1 mi 1 m t^^mn^^rati i rt^ 

*^1J11 1 J lU 1 1 1 LCI 1 1 UCi dLU 1 c 


J\J v_. 


MAS 




Sa li n itv 


0 'i-20% (Sea Salts) 


TAS [1[ 


MIGS-?? 

/ VI 1 VJ J Z.Z. 


y O CULJ [ i Ci 1 ICI 1 L 


DLllL-Liy dClUUlL, 


TAS n 1 




V_,d I kJ\JI I J\J\J 1 


romnlpy fp \/p<^t;f pyfr^rf npnfnnp^ 


TAS [1 [ 




Fn^^rcTX/ m^^ta nol tcm 

LI Id tiy 11 ICLdULJl iDl 1 1 


L,l Id 1 ILJI ICLCI vJLl UUI 1 


TA'i n 1 


MIGS-fi 


1 IdUILdL 


n\/ri/=irca 1 1 n^i coil 
1 1 y jjci jd 1 1 1 ic jwii 


TAS n 1 


iVUVJ J- 1 J 


R i/~\\'ir^ rp^ \ a 1 1 /~\n c h in 
DIL/LIL. 1 CldLILII Ibl 1 lU 


froo 1 1\ / 1 n (T 
1 1 cc 1 1 V 1 1 it: 


TAt; n 1 


MIGS-14 


Pathoppniritv 


none 


NAS 




R t/^c a Totx/ £i\/o 1 
DlUDdltrLy ItrVtrl 


1 
1 


TA^; [971 


MIGS-23.1 


Isolation 


hypersaline soil bordering a saltern 


TAS [1[ 


MIGS-4 


Geographic location 


Calblanque, Murcia (southeastern Spain) 


TAS [1[ 


MIGS-5 


Sample collection time 


1998 


NAS 


MIGS-4.1 


Latitude 


37.64 


NAS 


MIGS-4.2 


Longitude 


-0.77 


NAS_ 


MIGS-4.3 


Depth 


not reported 




MIGS-4.4 


Altitude 


not reported 





Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non- 
traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally 
accepted property for the species, or anecdotal evidence). Evidence codes are from of the Gene Ontology 
project [28[. 



Genome sequencing and annotation 

Genome project history 

The genome of 5. mucosus DSM 16094T was se- 
quenced as a part of the DFG funded project 
TRR51 "Ecology, Physiology and Molecular Biolo- 
gy of the Roseobacter clade: Towards a Systems 
Biology Understanding of a Globally Important 
Clade of Marine Bacteria". The strain was chosen 



for genome sequencing according the Genomic 
Encyclopedia of Bacteria and Archaea [GEBA) cri- 
teria [12,13]. Project information can found in the 
Genomes OnLine Database [16]. The genome se- 
quence is deposited in GenBank and the Integrat- 
ed Microbial Genomes database (IMG) [38]. A 
summary of the project information is shown in 
Table 2. 
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Table 2. Genome sequencing project information 



MIGS ID 


Property 


Term 


MlGS-31 


Finishing quality 


Non-contiguous finished 


MlGS-28 


Libraries used 


Two genomic libraries: one lllumina PE library (420 bp 
insert size), one 454 PE library (3kb insert size 


MlGS-29 


Sequencing platforms 


lllumina GA llx, lllumina MiSeq, 454 GS-FLX+ Titanium 


MIGS-31.2 


Sequencing coverage 


430x 


MIGS-30 


Assemblers 


velvet version 1 .1 .36, Newbler version 2.3, consed 20.0 


MlGS-32 


Gene calling method 


Prodigal 1 .4 




INSDC ID 


APVHOOOOOOOO 




GenBank Date of Release 


July 31, 2013 




GOLD ID 


Gi0042373 




NCBI project ID 


188761 




Database: IMG 


2523231081 


MlGS-13 


Source material identifier 


DSM 1 6094 




Project relevance 


Tree of Life, biodiversity 



Growth conditions and DNA isolation 

A culture of S. mucosus DSM 16094T was grown 
aerobically in DSMZ medium 512 [39] by adding 
2.5% NaCl at a temperature of 30°C. Genomic DNA 
was isolated using Jetflex Genomic DNA Purifica- 
tion Kit (GENOMED 600100) following the stand- 
ard protocol provided by the manufacturer but 
modified by an incubation time of 60 min, the in- 
cubation on ice overnight on a shaker, the use of 
additional 50 jal proteinase K, and the addition of 
100 [il protein precipitation buffer. The DNA is 
available from the Leibniz-Institute DSMZ through 
the DNA Bank Network [40]. 

Genome sequencing and assembly 

The genome was sequenced using a combination 
of two genomic libraries (Table 2). Sequencing 
and assembly were performed according to the 
protocol established for the genome of R. 
thermophilum DSM 16684T [17] with the following 
additional step. To achieve longer reads, the 
lllumina library was sequenced in one direction 
for 300 cycles, providing another 15.0 milHon 
reads. The hybrid assembly consisted of 
14,800,324 filtered lllumina reads with a median 
length of 201 bp. Pyrosequencing resulted in 
53,566 reads with an average read length of 308 
bp. After manual editing, the final assembly was 
composed of 84 contigs organized in 30 scaffolds. 
The combined sequences provided a 430 x genome 
coverage. 



Genome annotation 

Genome annotation was carried out using the JGl 
genome annotation pipeline as previously de- 
scribed [17]. 

Genome properties 

The genome statistics are provided in Table 3 and 
Figure 3. The genome of strain DSM 16094^ has a 
total length of 5,689,389 bp and a G+C content of 
67.1%. Of the 5,745 genes predicted, 5,650 were 
protein-coding genes, and 95 RNAs. The majority 
of the protein-coding genes (76.0%) were as- 
signed a putative function while the remaining 
ones were annotated as hypothetical proteins. The 
distribution of genes into COGs functional catego- 
ries is presented in Table 4 

Insights into the genome 
Plasmids and phages 

Genome sequencing of 5. mucosus DSM 16094''' re- 
sulted in 30 scaffolds. In the species description, it 
was reported that this strain contains at least sev- 
en plasmids (550, 467, 184, 140.8, 110.6, 98.2 and 
30.8 kb) [7]. However, the identification of plas- 
mids in the genome was difficult because typical 
replication modules comprising the characteristic 
replicase and the adjacent parAB partitioning op- 
eron are missing [41]. Nevertheless, comprehen- 
sive BLASTP searches with plasmid replicases 
from Rhodobacterales revealed the presence of 
three RepA and two RepB genes, whereas 
RepABC-type and DnaA-like replicases were ab- 
sent from the genome sequence. General genomic 
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features of the chromosome and four putative 
extrachromosomal repUcons are hsted in Table 5, 
whereas locus tags of the replicases and the large 
virB4 and virD4 genes of type IV secretion systems 
are presented in Table 6. The localization of the 
chromosomal replication initiator DnaA docu- 
ments that (at least) scaffold 3 represents the 
chromosome. The 350 kb scaffold 5 contains a 
RepA-a type replicase [42] and a characteristic 
type IV secretion system (T4SS) comprising the 
relaxase VirD2 and the coupling protein VirD4 as 
well as the complete virB gene cluster for the 

Table 3. Genome Statistics 



transmembrane channel [Table 6 [43]). It proba- 
bly represents a mobilizable extrachromosomal 
element. Scaffolds 18 and 28 contain RepA-b and 
RepB-I type replicases, respectively [Table 5), and 
may represent two additional plasmids of this 
species. However, the presence of plasmid 
replicases does not unequivocally correlate with 
extrachromosomal elements, as these genes may 
also reflect inactivated orphans or pseudogenes. 
Thus, the total number of 5. mucosus plasmids 
cannot be exactly determined based on the draft 
genome sequence. 



Attribute 


Value 


% of total' 


Genome size (bp) 


5,689,389 


100.00 


DNA coding region (bp) 


5,064,332 


89.01 


DNA G+C content (bp) 


3,815,108 


67.06 


Number of scaffolds 


30 




Total genes 


5,745 


100.00 


RNA genes 


95 


1.65 


rRNA operons 


4 




tRNA genes 


83 


1.44 


Protein-coding genes 


5,650 


98.35 


Genes with function prediction (pro- 






teins) 


4,365 


75.98 


Genes in paralog clusters 


4,666 


81.22 


Genes assigned to COGs 


4,191 


72.95 


Genes assigned Pfam domains 


4,465 


77.91_ 


Genes with signal peptides 


512 


8.91 


Genes with transmembrane helices 


1,183 


20.59 


CRISPR repeats 


1 






Jll I lllll' 



lllllllllllllllllll 




Figure 3. Graphical map of the largest scaffold. From bottom to the top: Genes on forward strand (colored by 
COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs 
red), GC content (black), GC skew (purple/olive). 
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Table 4. Number of genes associated with the 25 general COG functional categories 



Code Value %age° Description 



J 


175 


3.8 


Translation, ribosomal structure and biogenesis 


A 


0 


0.0 


RNA processing and modification 


K 


350 


7.5 


Transcription 


L 


228 


4.9 


Replication, recombination and repair 


B 


4 


0.1 


Chromatin structure and dynamics 


D 


37 


0.8 


Cell cycle control, cell division, chromosome partitioning 


Y 


0 


0.0 


Nuclear structure 


V 


47 


1.0 


Defense mechanisms 


T 


179 


3.8 


Signal transduction mechanisms 


M 


289 


6.2 


Cell wall/membrane/envelope biogenesis 


N 


96 


2.1 


Cell motility 


Z 


1 


0.0 


Cytoskeleton 


w 


0 


0.0 


Extracellular structures 


u 


84 


1.8 


Intracellular trafficking and secretion, and vesicular transport 


o 


141 


3.0 


Posttranslational modification, protein turnover, chaperones 


c 


289 


6.2 


Energy production and conversion 


c 


362 


7.8 


Carbohydrate transport and metabolism 


E 


507 


10.9 


Amino acid transport and metabolism 


F 


95 


2.0 


Nucleotide transport and metabolism 


H 


174 


3.7 


Coenzyme transport and metabolism 


1 


172 


3.7 


Lipid transport and metabolism 


P 


253 


5.4 


inorganic ion transport and metabolism 


Q 


180 


3.9 


Secondary metabolites biosynthesis, transport and catabolism 


R 


583 


12.5 


General function prediction only 


S 


414 


8.9 


Function unknown 




1,554 


27.1 


Not in COGs 



'The total is based on the total number of protein coding genes in the annotated genome. 

Table 5. General genomic features of the chromosome and putative extrachromosomal replicons from 5. mucosus 
DSM 1 6094" 



Replicon 


Scaffold 


Replicase 


Length (bp) 


GC (%) 


Topology 


No. Genes 


Chromosome 


3 


DnaA 


550,914 


68 


linear** 


522 


Plasmid 1 


5 


RepA-a 


349,771 


67 


linear** 


353 


Plasmid 2* 


18 


RepA-b 


45,096 


59 


linear** 


42 


Plasmid 3* 


28 


RepB-l 


15,330 


70 


linear** 


11 






RepA-c 










Plasmid 4* 


2 


RepB-ll 


702,451 


62 


linear** 


709 



^ Number of genes deduced from automatic annotation. 

*assignment uncertain 

**circularity not experimentally validated. 
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Table 6. Integrated Microbial Genome (IMG) locus tags of S. mucosus DSM 1 6094^ genes for the initiation of 
replication and type IV secretion systems (T4SS) required for conjugation 



Replicon 


Replication initiation 




Type IV Secretion 




Replicase 


Locus Tag 


VirB4 VirD4 


Chromosome 


DnaA 


salmuc_021 54 




Plasmid 1 


RepA-a 


salmuc_0321 0 


salmuc_02871 salmuc_03216 


Plasmid 2* 


RepA-b 


salmuc_05477 




Plasmid 3* 


RepB-l 


salmuc_05734 






RepA-c 


salmuc_01514 




Plasmid 4* 


RepB-ll 


salmuc_01780 





*assignment uncertain. 

A potential fourth plasmid is represented by the 
large 702 kb scaffold 2 that contains a RepA-c as 
well as a RepB-II rephcase (salmuc_01514; 
salmuc_01780). However, the presence of typical 
CRISPRs representing the defense system against 
phage attacks [44] favors a chromosomal affilia- 
tion for scaffold 2. 

The genome sequence of S. mucosus DSM 16094T 
reveals that this strain must encounter continuous 
attack by phages. Regions of genes related to 
prophages are found at several sites throughout 
the genome [e.g., salmuc_02795 - 02809 and 
salmuc_02619 - 02632). Several genes encoding 
cas proteins (salmuc_01330, salmuc_01331 and 
01333) indicate that a CRISPR defense system is 
functional in this strain. The large number of 
phage-related genes integrated into the genome 
could mediate frequent rearrangements of the 
DNA structure in this strain. Furthermore, this 
could indicate a possible exchange of genes with 
other species attacked by similar phages. 

Morphological traits reflected in the genome 

Analysis of the genome sequence of 5. mucosus 
DSM 16094T revealed the presence of a high num- 
ber of genes associated with putative production 
and biosynthesis of exopolysaccharides 
(salmuc_00030, salmuc_00724, salmuc_01174, 
salmuc_01693, salmuc_02911, salmuc_3919, 
salmuc_04853 and salmuc_05511). This finding is 
in accord with a recent study by Llamas and col- 
leagues, who characterized the exopoly- 
saccharides produced by strain A3t in detail [11]. 
Interestingly, genes putatively associated with cel- 
lulose synthesis (salmuc_02978 and 
salmuc_02979) were also found. 

Surprisingly, many genes involved in flagellar mo- 
tility [e.g., salmuc_02151 - salmuc_02191, 
salmuc_04184 - 04236) and chemotaxis (e.g., 
salmuc_03613 - 03617) were observed, although 



this strain was described as non-motile in the 
original species description [7]. 

Genes associated with the synthesis of poly- 
hydroxy-alkanoates as storage compound (e.g., 
salmuc_03738, salmuc_03739 and salmuc_05206) 
as well as phasin (salmuc_03343) were also found. 



Nutrient limitation 

Many saline environments, e.g., the central oceans, 
are characterized by a limitation of the essential 
nutrients iron and phosphorous. 5. mucosus seems 
to have installed several mechanisms to overcome 
growth limitation caused by depletion of both el- 
ements. Iron is mainly acquired by the synthesis of 
siderophores and transported into the cell in its 
chelated form. Genes encoding ABC transporters 
for siderophores of the hydroxamate type 
(salmuc_02461 - 02463 and salmuc_02667 - 
02669), as well as for hemin-bound iron 
(salmuc_00710 - 00712) were found. To satisfy 
the need for phosphorous, strain DSM 16094T is 
able to mobilize organic phosphonates as alterna- 
tives to phosphate, which is indicated by a contin- 
uous array of 22 genes (salmuc_00786 - 00807) 
involved in the uptake and utilization of 
phosphonates. 

Metabolic plasticity 

In contrast to the published description of 5. 
mucosus [7], which suggests a strictly aerobic and 
chemoheterotrophic metabolism, the genome re- 
veals an astonishing metabolic versatility. Besides 
genes for the degradation of organic substrates, 
we also found genes encoding enzymes for the uti- 
lization of alternative electron donors enabling 
facultative lithotrophic growth: a Sox 
multienzyme complex encoded by the genes 
salmuc_00587 - 00597 could be utihzed for the ox- 
idation of thiosulfate to sulfate, while molecular 
hydrogen may be utilized as electron donor by a 
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multimeric uptake hydrogenase of the [NiFe]-type 
(salmuc_04814 - 04830). A further potential sub- 
strate is carbon monoxide, which might be oxi- 
dized by an aerobic-type carbon monoxide dehy- 
drogenase encoded by the genes salmuc_05576 - 
05578. Additional genes encoding subunits of car- 
bon monoxide dehydrogenase were found dis- 
persed at several sites in the genome. The meta- 
bolic plasticity of this species is further reflected 
in a multiple branched electron transport chain. 
The cascade starts from a NADH dehydrogenase 
[salmuc_03065 - 03088) or succinate dehydro- 
genase complex [salmuc_00519 - 00521), where 
ubiquinone is reduced to ubiquinol. Electrons can 
either be transferred from ubiquinol via a termi- 
nal cytochrome bd ubiquinol oxidase 
[salmuc_05386 - 05387) directly to oxygen, or 
transferred to a cytochrome bci complex reducing 
cytochromes. Reduced cytochromes can then in- 
teract with terminal oxidases reducing oxygen. 
Genes for at least two different cytochrome c oxi- 
dases were detected, being either of a putative 
CQQs- (salmuc_05284 - 05285) or ebbs-type 
[salmuc_00548 - 00551). The chemiosmotic gra- 
dient generated in the electron transport chain 
can be used for the synthesis of ATP by an ATP 
synthase complex of the FoFl type (salmuc_01101 
- OHIO). According to the genome sequence there 
is also the possibility that nitrate could be used as 
alternative electron acceptor in the absence of ox- 
ygen. In addition to a periplasmic nitrate 
reductase of the Nap-type (salmuc_04127 - 
04129) genes for a copper-containing 
dissimilatory nitrite reductase (salmuc_05547), a 
nitric oxide reductase [salmuc_05554 and 
salmuc_05555) and a nitrous oxide reductase 
(salmuc_04123) were detected, resulting in a 
complete pathway for denitrification of nitrate to 
molecular nitrogen. 

Interestingly, the genome sequence of S. mucosus 
DSM 16094T further revealed the presence of a 
high number of genes associated with putative 
photoautotrophy. Next to a photosynthesis gene 
cluster (salmuc_05125 - 05164) RuBisCO- 
associated genes [samuc_03532 - 03534) involved 
in the fixation of CO2 via the Calvin-Benson cycle 
(salmuc_03531 - 03539) were observed. The 
presence of such genes indicates a putative photo- 
autotrophic growth under certain conditions. The 
genome of this strain also encodes a blue light- 
activated photosensor (BLUF, salmuc_00318) that 
may play a role in the light-dependent regulation 
of photosynthesis genes. It is tempting to specu- 



late that a genetic inventory allowing 
photoautotrophy reflects an evolutionary position 
at the root of the Roseobacter clade. Several mem- 
bers of this lineage are known to be capable of an 
aerobic photoheterotrophic metabolism, whereas 
photoautotrophic growth has not been reported 
yet. By analogy, with the scenario proposed for the 
evolution of aerobic photoheterotrophic Gamma- 
proteobacteha [45,46], representatives of the 
Roseobacter clade may have lost genes for CO2 fix- 
ation following adaptation to aerobic environ- 
ments characterized by electron donor limitation, 
thereby preventing utilization of the Calvin- 
Benson cycle, which demands an abundant supply 
of reducing power and energy. 

However, none of the novel metabolic traits, 
which are predicted based on the genome se- 
quence, could be verified experimentally in our 
laboratory so far. One explanation may be that 
under unfavorable growth conditions, e.g. 
anaerobiosis, lysogenic phages become activated, 
so that growth does not become apparent. 
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